Applying Rhythm Features to Automatically Assess Non-Native Speech
نویسندگان
چکیده
Speech rhythm measurements have been used in a limited number of previous studies on automated speech assessment, an approach using speech recognition technology to judge non-native speakers’ proficiency levels. However, one of the most problematic issues of these previous studies is a lack of a comparison of these rhythm features with other effective non-rhythm features found in decade-long previous research. In this paper, we extracted both non-rhythm and rhythm features and compared them with respect to their performances to predict proficiency scores rated by humans. We show that adding rhythm features significantly improves the performance of the scoring model based only on non-rhythm features.
منابع مشابه
L1 Perceptions of L2 Prosody: The Interplay Between Intonation, Rhythm, and Speech Rate and Their Contribution to Accentedness and Comprehensibility
This study investigates the cumulative effect of (non-)native intonation, rhythm, and speech rate in utterances produced by Spanish learners of Dutch on Dutch native listeners’ perceptions. In order to assess the relative contribution of these language-specific properties to perceived accentedness and comprehensibility, speech produced by Spanish learners of Dutch was manipulated using transpla...
متن کاملApplying rhythm metrics to non-native spontaneous speech
This study investigates a variety of rhythm metrics on two corpora of non-native spontaneous speech and compares the nonnative distributions to values from a corpus of native speech. Several of the metrics are shown to differentiate well between native and non-native speakers and to also have moderate correlations with English proficiency scores that were assigned to the non-native speech. The ...
متن کاملAutomatic Assessment of Non-Native Prosody for English as L2
We recorded non-native English productions of 55 speakers; a subset of these productions was assessed by 60 native English speakers as for their quality w. r. t. intelligibility, rhythm, etc. Applying multiple linear regression on a large prosodic feature vector – modelling approaches known from the literature as well as generic prosody – we can automatically predict the listener’s assessments ...
متن کاملNon-native speech rhyth
This study investigates non-native speech rhythm in German. Three native speakers and 18 learners of German with three different native languages were recorded reading and re-telling a story. Vowel reduction and deletion was analyzed in one particular type of German suffix. Speech rhythm was investigated with the syllable ratio, a new measurement which determines the mean ratio of durational re...
متن کاملThe Automatic Assessment of Non-native Prosody: Combining Classical Prosodic Analysis with Acoustic Modelling
In earlier studies, we employed a large prosodic feature vector to assess the quality of L2 learner’s utterances with respect to sentence melody and rhythm. In this paper, we combine these features with two standard approaches in paralinguistic analysis: (1) features derived from a Gaussian Mixture Model used as Universal Background Model (GMM-UBM), and (2) openSMILE, an open-source toolkit for...
متن کامل